Statistical Analysis to Establish the Importance of Information Retrieval Parameters
نویسندگان
چکیده
Search engines are based on models to index documents, match queries and documents and rank documents. Research in Information Retrieval (IR) aims at defining these models and their parameters in order to optimize the results. Using benchmark collections, it has been shown that there is not a best system configuration that works for any query, but rather that performance varies from one query to another. It would be interesting if a meta-system could decide which system configuration should process a new query by learning from the context of previousqueries. This paper reports a deep analysis considering more than 80,000 search engine configurations applied to 100 queries and the corresponding performance. The goal of the analysis is to identify which configuration responds best to a certain type of query. We considered two approaches to define query types: one is post-evaluation, based on query clustering according to the performance measured with Average Precision, while the second approach is pre-evaluation, using query features (including query difficulty predictors) to cluster queries. Globally, we identified two parameters that should be optimized: retrieving model and TrecQueryTags process. One could expect such results as these two parameters are major components of IR process. However our work results in two main conclusions: 1/ based on post-evaluation approach, we found that retrieving model is the most influential parameter for easy queries while TrecQueryTags process is for hard queries; 2/ for pre-evaluation, current query features do not allow to cluster queries to identify differences in the influential parameters.
منابع مشابه
مدیریت اطلاعات سلامت دربخش های اورژانس بیمارستان های تابعه دانشگاه علوم پزشکی اصفهان
Introduction: For emergency patients, the quality of the treatment based on evidences and information, are key factors in their lives or dead or even disabilities for the rest of their lives. Considering this point the importance of information for timely decision making, and introducing appropriate plan for patients, in intensive centers such as emergency centers -in which lives and death of p...
متن کاملAssessing the level of familiarity, use and also the effectiveness of mind maps in the information retrieval process
Background and Aim: Mind map is a full-color illustrated note-taking in which, main idea or subject is situated. The main ideas then branch out from the center, which are linked to the central idea. This is a relatively new topic, and slight research has been conducted to show its effectiveness worldwide. The aim is to examine the effectiveness of mind maps in the information retrieval process....
متن کاملThe Ranking of municipal responsibilities importance and satisfaction in the citizens perspectives (Case study: Tehran Municipality-region 6)
Humans in the course of urban development in order to conduct the affairs of the citizens have attempted to establish a municipal entity and the duties and responsibilities to municipalities are considered. municipal performance analysis can be the best way to ensure proper duties as mayor. this study aimed to evaluate the satisfaction level of citizens the function and importance of the servic...
متن کاملدیداری کردن نتایج جستوجو در فرایند بازیابی اطلاعات
Purpose: One of the most effective ways to achieve optimum information retrieval is through visualization of Information. Search strategies, probing skills, querying of information needs and analysis of information play a significant role in the accessing of necessary and useful information. Besides the factors mentioned above, information visualization can increase the availability level of in...
متن کاملAssessing the Internal Structure of the Ellis Information Retrieval Model in Order to Present the Persian Norm of Web Retrieval Tools
Introduction: Study evaluated the internal structure of Ellis information seeking model in the student community with the aim of presenting the Persian norm. Methods: This is a descriptive-analytical study conducted by cross-sectional survey method in the second semester of the academic year 1399-1400. Population comprise of 280 graduate students at Ahvaz Jundishapur University of Medical Scien...
متن کاملThe Effect of Term Importance Degree on Text Retrieval
Various approaches to index term-weighting have been investigated. In fact, term-weighting is an indispensable process for document ranking in most retrieval systems. As well actual information retrieval systems have to deal with explosive growth of documents of various sizes and terms of various frequencies because an appropriate term-weighting scheme has a crucial impact on the overall perfor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. UCS
دوره 21 شماره
صفحات -
تاریخ انتشار 2015